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Description 

[0001] The present invention relates to a plasmid vector of Rhodococcus , a new constitutive promoter an expression 
vector containing said promoter, microorganisms transformed with the expression vector and their use in the production 
5 of proteins, r 

[0002] Bacteria of the Rhodococcus genus are of wide interest in the field of the biode gradation and biotransformation 
of organic compounds (Warhurst and Fewson, 1994, Crit. Rev. Biotechnol., 14:29-73). 

[0003J Processes are known, for example, which use strains of Rhodococcus for the selective removal of orqanic 
sulfur from fossil fuels (U.S. 5,358,870, U.S. 5,132,21 9, PCT/US92/01868, EP-445896) and for the production of en- 
zymes involved in the production of acrylamides (Kobayashi et a)., 1992, Tends Biotechnol., 10:402-40B) carboxvlic 
acids, L-aminoacids (W09804733) and enantiomorphs of chiral compounds (U.S. 5,672,504). 

[0004J The main restricting factor in optimizing biocatalysis processes which use these bacteria is however the lack 
of suitable genetic instruments. 

[0005] This term refers to expression vectors in Rhodococcus which: 

are present in cells in multiple copies; 

- are steadily maintained inside the cells without costiy selective agents (for example antfoiotics), which considerably 
influence the economic convenience of an industrial process; and 

- contain a strong promoter, i.e. capable of allowing an effective expression of a gene, or a strong constitutive pro- 
moter which does not require the use of inductors and is not susceptible to repressors. 

[0006] In fact, a limit in the removal of organic sulfur from fossil fuels with strains of Rhodococcus which produce the 
sulfate^ 3 * 10 COrnP ' eX ' iS dUe t0 the P resence - "Pstream the corresponding genes, of a promoter greatly inhibited by 

[0007] To overcome this drawback, the genes encoding this enzymaticcomplex were placed in Rhodococcus vectors 
under the control of constitutive heterologous promoters such as that of the gene for resistance to chloramphenicol of 
^gdg^u^fagjgigng (Plddington, C.S. et at., 1995, Appl. and Env. Microbiol., 61,2,: 468-475) or of the gene sacB 
of B.subt.l.s (Dems-Larose. C. et al, 1998, Appt. and Env. Microbiol., 64,11 ■ 4363-4367) and of the gene for resistance 
to Kanamycin of ^coli (Serbolisca et a)., Appl. Microbiol. Biotechnol. 1999, 52:122-126). The maintenance of the 
vectors, however, required the presence of a selective agent in the cuiture medium and, furthermore, the expression 
of the sox operon under the control of the promoter sac B proved to be very low (Lau. P. et al. 1 999 ACS Fuel Chem.: 

[0008] It has now been found that the disadvantages of the known art described above can be overcome by the 
expression vector of the present invention. y 

[0009] In accordance with this, a first objective of the present invention relates to the cloning plasmid vector pSM843 
stable tn Rhodococcus . w r H 

[0010] A further objective of the present invention reiates to a new constitutive promoter of Rhodococcus capable 
of Greeting the expression of a homologous or heterologous gene with a high efficiency and characterized by the 
sequence SEQ. fD. Nr, 2. y 

[001 1] Another objective of the present invention relates to an expression vector in Rhodococcus which comprises 
said constitutive promoter. — K 

[0012] A further objective of thepresent Invention relates to astrain of Rhodococcus transformed with said expression 

[001 3] Yet another objective of the present invention relates to a process for the production of homologous or het- 
erologous proteins in Rhodococcus bacteria transformed with said expression vector 

[0014] Additional objectives of the present invention will appear evident from the following description and examples. 
Brief description of the figures 

[0015] Figure 1 : shows the restriction map of the 1 1 kb plasmid pSM841 . 
[0016] Figure 2 : shows the restriction map of the 7.3 kb plasmid pSM843. 
[0017] Figure 3 : shows the restriction map of the E.coli pfasmid pSM839. 

gjJJ 47 ElgyiEi: shows the restriction map of the piasmid pSM846. Figure's : shows the restriction map of the plasmid 
[0019] In particular, the expression vector according to the present invention comprises: 

(a) the rep genes, ORF81 and trbA which encode proteins involved in replication in Rhodococcu s- 

(b) a gene called parA whose product is necessary for maintaining the plasmid in the absence of selective pressure 
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and is characterized by the sequence SEQ ID, Nr. 1 ; 

(c) a constitutive promoter Df Rhodococcus having the sequence SEQ. ID. Nr. 2; 

(d) a multiple cloning site downstream the promoter; and 

(e) at least one gene which encodes a genetic marker selected, for example, from genes of the cad operon (SEQ 
ID, Nr. 3), which provide resistance to cadmium or genes which encode resistance to an antibiotic. 

[0020J The expression vector also contains the replication origin in E.coli and can therefore be used as shuttle vector 
in Rhodococcus and E.coli . 

[0021] The expression vector, indicated hereunder as pSM846, was obtained by: 

(1 ) construction of the cloning vector pSM843; 

(2) isolation of a constitutive promoter of Rhodococcus ; and 

(3) insertion of said constitutive promoter into the vector pSM843. 

[0022] The plasmid vector pSMB43 was prepared by reducing the dimensions of the 130 Kb plasmid of Rhodococcus 
s£. DS7 containing the sox operon, the genes which confer resistance to cadmium, and those for resistance to arsenic 
(Margarit et aL, 1 997 and Serbolisca et al. B 1 999). This reduction was effected by means of the following strategy: 

(a) search for plasmids deriving from the deletion of the 130 Kb plasmid following transformation of a recipient 
20 strain and isolation of a 22 kb plasmid; 

(b) digestion of said plasmid with suitable restriction enzymes, selMigase and isolation of the 11 kb plasmid 
pSM841; 

(c) characterization of the plasmid obtained in (b) and 

(d) construction of the 7.3 kb plasmid pSM843 containing at least one genetic marker and the genes parA rep 
and trbA respectively necessary for stability and replication in Rhodococcus . 

[0023] The stability of the vectors obtained was controlled after each reduction so as to select only those maintained 
in at least 90% of the cells of the host strain for at least 30^40 generations, in complete medium and in the absence of 
selective pressure. 

[0024] Research on constitutive promoters inside the chromosome of a strain of Rhodococcus was effected forthe 
construction of the expression vector, using a new method, which is included in the scope of the present invention 
[0025] This method is based on the observation that strains of Rhodococcus have the capacity of integrating at 
random fragments of foreign DNA in their chromosome, without the necessity for a sequence homology higher than 3 
bp between the donor DNA and that of the host. The integration effectiveness was estimated at about 1 o*-i o 3 colonies 
55 per \ig of DNA in Rhodococcus transformation experiments. 
[0026] The method consists in: 

(i) transforming a strain of Rhodococcus directly with a gene reporter without its promoter or with a multicopy 
plasmid of E.coli containing said gene and linearized upstream the gene reporter; 

(ii) selecting the clones which express said gene, i.e. the clones which have integrated the gene reporter in their 
chromosome, downstream a promoter sequence; 

(iii) digesting the chromosomal DNA of the selected clones with restriction enzymes which cut upstream and down- 
stream the gene; 

(iv) amplifying the DNA obtained in step (iii); and 

(v) sequencing the promoter upstream the gene reporter. 

[0027] Gene reporter refers to a fragment of DNA which encodes a product that allows the selection of the clones 
which express it. Examples of gene reporters useful for this purpose can be selected from those which encode resist- 
ance to antibiotics or heavy metals or enzymes such as XylE or the same Sox proteins. 

[0028] With respect to the techniques currently used, this method is much more efficient and rapid as it does not 
require the preparation of genome banks with fragments of chromosomal DNA upstream the gene reporter This method 
can be applied to all microorganisms which, like Rhodococcus , are capable of integrating at random fragments of 
foreign DNA in their chromosome without the necessity for a high sequence homology between the donor DNA and 
that of the host 

[0029] With this method, aconstitutive promoter was identified of a gene of Rhodococcus having the sequence SEQ 
ID Nr. 2. This promoter was inserted into the plasmid vector psMS43, obtaining the expression vector pSM846 which 
can be used for the production of proteins of interest in Rhodococcus . 

[0030] The segregational and structural stability of the expression vector pSM846 in strains of Rhodococcus was 
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determined by operating as described in Example 4. The results demonstrated that the vector is maintained in over 
90% of the cellular population even after subsequent culture passages without selective agents, thus showinq a hiah 
segregations! stability. ^ y 

10031 J Furthermore, analysis of the plasmids isolated from the transformed strains showed that, even after various 
generations in liquid culture, this plasmid remains structurally stable in different strains of Rhodococcus with from 4 to 
8 copies per cell. ■ — 

[0032] The expression vector of the present invention can be used for the expression of genes which encode proteins 
of interest such as, for example, enzymes involved in the selective removal of organic sulfur from fossil fuels (SoxA 
SoxB, SoxC, SoxD), the production of L-aminoacids {amidase, aspartase), the production of enantiomorphs of chira! 
compounds (epoxide hydrolase, ketoestero reductase), the production of carboxyfic acids (nrtriiase), etc. 
[0033] The expression vectorcan be used in various species of Rhodococcus and in phyJogenetically similar bacteria 
such as Gordona and Nooardja . 

[0034] The capacity of the promoter of directing the constitutive expression of the gene put under its control was 
verified by positioning downstream said promoter, the sox operon isolated according to what is described by Serbolisca 
L., de Ferra, F., Margarit, L, Appl. Microbiol. BiotechnoL, 1999, 52:122-126. 
[0035] The results obtained demonstrated that this promoter allows an effective and constitutive expression of the 
SoxA, SoxB and SoxC proteins in the presence of inorganic sulfur in the culture medium. 

[0036] The strains containing the vectors pSM843, pSM846 and pSM847 were deposited attheCentraalbureau Voor 
Schimmei-cuitures as Rhodococcus SMV 112, SMV 113 and SMV 114, where they received the respective numbers 
20 CBS 1 02445, 1 02446 and 1 02447. 

[0037] The foilowing were used in the experiments described hereunder: 

- the 1 30 Kb plasmid containing the sox operon which encodes enzymes responsible for the conversion of drben- 
zothtophene (DTB) to 2-hydroxybrphenyl, and the genes which encode resistance to cadmium and arsenic Said 

25 plasmid was isolated from the strain Rhodococcus sp. DS7 (Margarit, !. et ai., 1997, 9 th Proceedings of the Inter- 

national Conference on Coal Science: 1579-1582); 

- the plasmid pSM789, incapable of replicating itself in Rhodococcus , obtained by inserting Into the plasmid of E 
coN pUC18, the 4549 bp fragment Hindili-EcorRI which contains the sox operon without promoter isolated from 
Rhodococcus sp. DS7 (Margarit, I. et al., 1997, 9* Proceedings of the international Conference on Coal Science- 
s'? 1 579-1 582) : 

[0038] The Rhodococcus sp. DS7 strain was previously ciassified as Arthrobacter on the basis of microbiological 
test (D'Addano 1996 Proceedings of the Symposium AAA Biotechnology. Shejbal, E. and Ferrara: 139-149) Subse- 
quently, a comparison of the DNA 16 S sequence with that of the data banks indicated that the strain belonged to the 
35 Rhodococcus genus. The most consistent homologies were found with Rho dococcus er ith re us and Rhodococcus ervth- 
ropolis . 1 — 

- the Rhodococcus DS~2 strain, obtained from Rhodococcus sp. DS7, incapable of desulfurizing as it does not have 
the 130 kb plasmid (Margarit, I et al., 1997, 9^ Proceedings of the International Conference on Coai Science* 

*o 1579-1582). 

[0039] The following examples are illustrative but do not limit the scope of the invention itself. 
EXAMPLE 1 
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Construction of the plasmid vector pSM843 

[0040] Electrocompetent cells of Rhodococcus DS-2 (100 jtf) (BIORAD Gene Pulser TM, 400 ohm 25 uF) were 
transformed with the 130 kb plasmid <1 00 ng). The recombinant clones were selected on piates of LB-agar containing 
0.5 mM of CdCI 2 and subsequently plated on minimum medium (1 0 g/l of KH 2 P0 4 pH 7.4 2 5 g/l of NH 4 CI O 2 g/l of 
MgCI 2 -6H 2 0, 0.02 g/f of CaCI 2 , 0.01 g/l of FeCi 3 , 0.005 g/l of MnCI 2 *2H 2 0, 0,003 g/l of ZnCI 2 , 0.0009 g/i of CuCL-2H ? 0 
agarose 0.8%) containing dibenzothiophene (DBT) as sole sulfur source. One of the cadmium-resistant clones proved 
to be incapable of desulfurizing DBT The pfasmid DNA was extracted from this clone, which, after being tested on 
agarose gel at 0.7%, showed dimensions of about 1 00 kb, 

[0041] Operating as described above, from the transformation of DS-2 with the 100 kb plasmid, a 35 kb plasmid was 
obtained which no longer conferred resistance to arsenic, and from this a 22 kb plasmid. 

[0042J The plasmid stability was checked for the three vectors in order to select only those maintained in the host 
strain for at least 30-40 generations in the absence of selective pressure. 
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[0043] The 22 kb plasmid {1 u.g) was digested with 4 U of the restriction enzyme Sex Af (Boehringer), in 1 0 jjj of The 
salts supplied by the producer, incubating for 60 minutes at 37°C and 15 minutes at 70°C. The linearized plasmid (2 
was self-ligated in 10 jil of buffer, in the presence of 1 U of T4 DNA ligase, for 16 hours at 16*C. 1 u.1 of the ligase 
mixture was used to transform DS-2 electrocompetent cells. A 18.7 kb plasmid was isolated from one of the clones 
5 containing plasmids with reduced dimensions. This plasmid was digested with the enzyme EcoRf obtaining a 15.8 kb 
vector which was further reduced by digestion with the enzymes Sspl and Dral, which leave truncated ends. 
[0044] The resulting 11 kb plasmid, called pSM841 (figure 1 ), was sequenced and showed the presence of: 

the cad operon which confers resistance to cadmium and having the sequence SEQ. ID. Nr. 3: 
to - rep genes, ORF81 and trbA encoding proteins involved in replication described by Denis-Larose, C. et aJ., 1998 

Appl. and Env. Microbiol., 64, 11 : 4363-4367. A 15 bp sequence was identified between these two genes, which! 

by homology with other sequences, may correspond to the replication origin of the plasmid; and 
- a gene parA whose sequence (SEQ ID Nr. 1 ) shows a partial homology with that of genes involved in the partioning 

of the plasmids which takes place during the cellular division. 2 repeated sequences each of 24 bp, were identified 
*s upstream this gene. 

[0045] To confirm the importance of the DNA region containing the parA gene in maintaining pSM841 , an 8 8 kb 
plasmid was constructed without this region. The results demonstrated a reduction of over 50% in the plasmid stability. 
[0046] Once the genes indispensable for maintaining the plasmid pSM841 had been identified, a 7.3 kb vector was 
20 constructed, containing: 

1 - a fragment Drai-EcoRi of 271 7 bp comprising the cad operon; 

2- a fragment Sspl-Ncol of 2716 bp comprising the genes trb A and rep A; 

3- a fragment EcoRI-Ncol of 1857 bp comprising the gene par A and a multiple cloning site (MCS). 

[0047] These fragments were isolated from the plasmid pSM841 by means of amplification and then ligated in the 
presence of T4 DNA ligase at 16°C for a night. 

[0048] The resulting piasrnid, called pSM843, has the restriction map indicated in figure 2. 
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Isolation of the constitutive promoter 



[0049] The plasmid pSM789 (5 itg) was digested with 40 Units (U) of the restriction enzyme Hindlll (Boehringer) in 
35 50 \i\ of buffer supplied by the manufacturer. 

[0050] After incubation at 37*C for 60 minutes the denaturation/extraction of the proteins was effected by adding a 
volume of phenol-chloroform (1:1) and a volume of chloroform:isoamyl alcohol (23:1). The DNA was subsequently 
precipitated by adding 5 u.l of a 3 M solution of Na-acetate and 300 fil of ethanol to the aqueous phase and then 
resuspended in 10 uJ of H 2 0. fn this way, an open linear plasmid was obtained exactly upstream the RBS reoion of 
40 the first gene of the sox operon (sox A). 

[0051] The linearized DNA (0.2 ^g) was used to transform 100 uJ of cells of Rhodococcus sp . DS-2 and the trans- 
formants were then selected on minimum medium plates containing 0.04 g/lof DBTand 1%of C 2 H 5 OH as sole sources 
of sulfur and carbon, respectively. " ~" " ^ 

[0052] 1 00 clones capable of growing were isolated using DBT As the plasmid pSM789 was incapable of replicating 
itself in Rhodococcus, the transformants obtained {about 1 0* per u.g of DNA used) had to contain the sox operon inside 
their chromosome, downstream a promoter which allowed its expression. 

[0053] In order to determine whether the integration took place in different points of the chromosome, the chromo- 
somal DNA was extracted from 20 clones (Current Protocols in Molecular Biology - Vol 1) and digested with 20 U of 
the enzyme Not], for which there is only one binding sequence inside the sox operon. The digested DNA was subjected 
to electrophoresis on agarose gel 0.8%, visualized by colouring with EtBr 0.5% and transferred onto nylon (Nyfon 
Membranes, positively charged - Boehringer) by means of Southern Blot (Sambrook, J., Fritsch, E.F., Maniatis, T. 1 989 
Molecular cloning: a laboratory manual 2nd edn. Cold Spring Harbor, NY). 

[0054] The DNA was then hybridized with a probe whose nucleotidic sequence corresponded to a fragment of 770 
bp of the sox C gene f using the non-radioactive method provided by the DIG SYSTEM- kit Boehringer. The hybridization 
55 reaction was effected at 68° C. 

[0055] Of the clones tested a single band was observed, with a different molecular weight for each clone, which 

indicated a single integration event of the plasmid pSM789 in different points of the chromosome. 

[0056] In order to verify if these clones were capable of desulfurizing DBT in the presence of inorganic sulfur, they 
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were plated on mm.mum medium conta.n.ng both DBT and MgS0 4 -7H 2 O (0.20 g/l), in parallel with the Rhodococcus 
IP, DS7 strain. After 12 hours, upon exposing the plate to UV rays at 254 nm, a fluorescent halo was o bserved for all 
the recombinant clones, whereas no halo was visible forthe native strain. This indicated that the activity of the isolated 
promoters was not repressed by sulfate. The clone which had the most consistent halo was called SMV11 0 



EXAMPLE 3 

Characterization of the promoter contained in SMV110 



[0057] 1 ug of chromosomal DNA of the clone SMV11 0 was digested with 20 U of the enzyme Clai, in 1 0 ul of buffer 
(Boehringer), at 37»C for 1 6 hours. After deactivation of the enzymes at 70»C for 1 5 minutes, 2 uJ of the digested DNA 
w^as treated with 1 U of the enzyme T4 DNA ligase in 1 0 pi of buffer (Boehringer), incubating at 1 6»C for 16 hours 

E 1 qUOt (1 ftl> ° f ' igaSe miXtUre W3S USSd t0 transf orm cells of E coli XL1 " Blue made eiectrocompetent 
(Dower, W.J. et at. 1 988 Nucleic Acids Research, 1 B: 6127-6145). irocomperem 

15 [ ? 0S9 L ™! transformants were selected on plates of agarized LB medium containing 1 00 fig/mf of Ampicillin The 
plasmrd DNA extracted from one of the clones thus obtained was analyzed by restriction analysis. The resuits indicated 
that the plasmid consisted of a 4.5 kb fragment, deriving from the chromosomal DNA of DS-2, and the plasmid pSM789 
The map of this new plasmid, called pSM839, is showed in figure 3. 

so K 0< S ^ ^ 5 k ^ frag r m9nt W3S then am P lified with ^e Polymerase Chain Reaction (PCR) technique, (I eung D 
20 W., Chen, E., Goeddel, D.V., 1989 Technique- a journai methods in ceil and molecular biology 1 Nr usina 
the following pair of oiigonucieotides: 9 



25 1) 5' CAGTCACGAC GTTGTAAAAC GA 3' (FORWARD) 



2) 5' TGCATTTGTC GTTGTTGAGT 3' {REVERSE) 

[0061 ] The amplification was carried out in a DNA Thermal Cycler 480 apparatus (Perkin-Elmer Cetus) using 1 00 id 
m,xture c °nta'»ing: 5 ng of plasmid DNA, 60 pmoles of the two oligonucleotides, 200 u.M dNTPs (dATP 
dGTP, dTTP, dCTP) and 1 U of Taq polymerase (Boehringer), in the buffer recommended by the manufacturer After 
denaturation for 2 minutes at 94°C, the cyclic program was started, which comprises: 1 minute at 98'C 1 minute at 
60 C and 3 minutes at 72°C for a total of 25 cycles, followed by 8 minutes at 72*C (final extension) 
[0062] The amplified 4.5 kb fragment was then sequenced by means of a DNA sequencer ABI 373 (Perkin Elmer) 
The sequence and possible open reading frames were compared with those present in DNA and protein banks using 
he research motor BLAST supplied by NCBI (Altscul, S.F., Madden, Th, Schaffer, A., Zhang, J., Zhang Z Miller W 
Lipman.D., 1997 Nucleic Acids Research 25: 3398-3402). ! ' ' 

[0063] The results showed that the integration of P SM789 in the chromosomal DNA of the DS-2 strain had t*ken 
place inside a gene which encodes a protein homologous to some endoglucanases and amylases of other rmu'wr- 
ganisms. 

[0064] It was therefore deduced that in the clone SMV110, the expression of the sox operon was controlled by the 
promoter of said gene. In order to characterize this promoter, the exact transcription starting point of the sox operon 
was determined by primer extension using the 5' RACE System kit (BRL) ~ 

[0065] The results indicated that the start of the transcription of the sox operon took place at a distance of 622 bp 

from 5 of the operon and that the promoter was therefore situated immediately upstream said region 

[0066] Analysis of the sequence upstream the transcription starting site revealed the presence of the presumed -1 0 

region. 

EXAMPLE 4 

Construction of the expression vector pSM846 

D 5S?« A ^ 100 L bp fra 9 m<?nt containing the constitutive promoter was amplified by PCR starting from the plasmid 
pSMS39 using the following oligonucleotides: 9 piasmia 
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(i) 5'GATCGGCC GGG ATCC &CGAGT GTT 3' {forward} 

BamHI 

(ii) 5'ACCCAACACTGAGCTGTTAA CGGCCGGAGC GGCCGATGCA 



HindlXI Hpal Sfil 
TT TTA GGTGA TGCCCGGG 3' (reverse) • 

« Nsi 

[0068] The amplified fragment was digested with the enzymes Hindlll and BamHI, eluted from agarose and inserted 
into the plasmid pUC18, The resulting plasmid was digested with the enzymes Afllll-Hpal and ligated to pSM843, 
previously digested with Afllli and NcoJ (which form compatible ends with the former) in the presence of T4 DNA ligase 
at 16°C for a night. A new vector shuttle of E.coli - Rhodococcus , called pSM846, was thus obtained, containing the 
promoter followed by an MCS for the constitutive expression of proteins in Rhodococcus . The strain of Rhodococcus 
DS-2 containing the pfasmid pSM846, whose map is showed in figure 4, was called SM V1 12. 

25 EXAMPLES 

[0069] In order to verify the stability of the plasmid pSM846 in the transformed strains of Rhodococcus after a pro~ 
(onged period of culture in the absence of selective pressure, three independent clones of the strain SMV112 were 
grown at 30*0, 200 rpm for 16 hours in 100 ml flasks, containing 20 mi of LB medium (Bacto Triptone 10 q/l veast 
30 extracts g/i, NaC1 10 g/l). ' 

[0070] The three cultures (0.1 ml) were used to inoculate a further 20 ml of the same mediums, and the new cultures 
were grown at 30°C, 200 rpm for 16 hours. This procedure was repeated a further 3 times, the cellular growth being 
followed as an increase of the optic density measured at 600 nm (O.D. 600). 

[0071] At the end of each growth, aliquots of cultures were removed, suitably diluted and then plated on LB agar 
medium. The plates were incubated at 30*0 for 1 6 hours and the colonies were then counted (CFU/ml). In this way it 
was determined that there were at least 35 generations at the end of the experiment. 

[0072] In order to determine the percentage of celis which had maintained the plasmid for at least 35 generations, 
and were therefore resistant to cadmium, 100 single colonies deriving from the plating by dilution, after the 5th growth 
of the three cultures, were placed on LB plates containing CdCig at a concentration of 0.5 mM. 

[0073] it was observed that the plasmid was maintained in over 90% of the cellular population, thus showing a con- 
siderable segregational stability even after subsequent passages in culture in the absence of selective pressure, 
[0074] In order to verify the structural stability of the plasmid pSM846, at the end of the 5th growth passage, the 
plasmid DNA was extracted from an aliquot of the cultures of 1 2 clones, using the method described in Serboiisca et 
al. (1999). The DNA obtained we re digested with various restriction enzymes for which there are single or double sites 
in the plasmid p$M846 and analyzed by means of electrophoresis on agarose gei for 2 hours at 90 V, 90 mA; the gels 
were coloured with Ethidium Bromide 1 mg/mL The visualization of the gel on a UV light trans-illuminator showed the 
expected bands according to the restriction map of the plasmid, demonstrating that pSM846 was structurally stable in 
Rhodococcus for several generations in the absence of selective pressure. 

50 EXAMPLE 6 

Determination of the number of copies of pSM846 per cell 



[0075] The average number of copies of the plasmid pSM846 per cell was estimated by means of quantitative PCR 
on preparations of total DNA from cells of Rhodococcus DS-2 containing said plasmid. The method is based on the 
determination of the quantity of DNA obtained by amplification of a fragment of the plasmid. This quantity is directly 
correlated to the concentration of the plasmid itself, used as a mould in the amplification reaction. 
[0076] The total DNA was prepared according to the method illustrated in Current Protocols in Molecular Biology - 
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Ausubel et al., Ed. Wiley Interscience Section 2.4.1 with the following modification: to obtain a complete cellular lysis 
the centrifuged cells from 1.5 ml of culture were resuspended in 567 microliters of TE (TrisHCI 10 mM EDTA 1 mM 
P H 8) containing lysozyme 50 ug/ml and were incubated at 37"C for20 minutes before adding SDS and proteinase K 
according to protocol. 

[0077] After a treatment of 10 minutes with RNasi, the concentration of total DNA was estimated on agarose qel and 
the samples were diluted to 1 00 pg of DNA per microliter. 

10078] PCR reactions were effected on the DNA thus obtained, using two oligonucleotides (primers) which appear 
inside the operon for resistance to cadmium and having the following sequence: 

" 5' TGGCCCGGCC GGAATTGATG GAC 3' {primer Cd4) 

and 

- 5' GCCGACGGCC GCGATCGTGA TCAG 3' (primer Cdl7) . 

[0079] The standardization was effected by means of a second PCR reaction on the portion of chromosomal DNA 
encoding RNA 16S of Rhodococcus DS7 using oligonucleotides specific for this sequence. 
[0080] The number of copies of plasmid was estimated on both the strain containing the pfasmid pSM846 and on a 
strain containing a single copy of the genes for resistance to cadmium integrated in the chromosome and on the strain 
DS7. 

25 p)081] The amplification reactions were carried out according to the instructions of the Syber Green kit of Perkin 

[0082] The amplified DNA was marked with the Syber Green fluorescent marker and quantified with a PE Applied 
Biosystems GeneAmp 5700 instrument. Each sample was triply amplified and the fluorescence values were compared 
with calibration curves included in each experiment for each pair of primers. The quantity of DNA obtained by amplifyinq 
with the pnmers specific for the plasmid was then corrected for the quantity of chromosomal DNA amplified with the 
primers specific for the RNA 1 6S genes. 

[0083] On comparing the three strains tested it was established tbatthe strain DS7 contains an average of 2-3 copies 
of the 130 kb plasmid per cell, whereas cells of Rhodococcus DS-2 transformed with pSM846 contain from 4 to 8 
copies of plasmid per cell. 
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EXAMPLE 7 



Construction of the plasmid pSMB47 



[0084] The objective of the experiment was to obtain a strain containing several copies of sox genes underthe control 
of the constitutive promoter identified. 

[0085] The fragment of DNA containing the promoter identified in example 3, was amplified starting from the plasmid 
pSM839 by means of PCR, using the following oligonucleotides: 

- 5' ATTCGAGAGT GCATATGCGG AAC 3' (FORWARD) 

- 5' CCATTTCTTC CAAGCTTCCG CCG 3' (REVERSE) 

which pair with the sequence SEQ. ID. Nr. 2, Into which the restriction sites Ndel and Hindlll were introduced 
[0086] About 500 ng of the DNA obtained from the amplification reaction were digested with 4 U of the enzymes 
Ndel and Hindlll for 60 minutes at 37"C and purified on Nusieve agarose gel at 1 .5% (FMC products) Parallellv 1 ua 
of the plasmid pSM789 was digested with 4 U of the same enzymes for 60 minutes at 37°C 50 ng of vector and 25 
ng of insert purified from agarose were then ligated in the presence of 1 U of the enzyme T4 ligase. The ligase mixture 
was subsequently used to transform competent ceils of E.coli XL-blue. 

[0087] 500 ng of the piasmid extracted from one of the clones of E^coli containing the desired insert and 500 nq of 
the plasmid P SM843 were digested separately with 4 U of the restriction enzyme Sspl (New England Biolabs). The 
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reactions were carried out at 37°C for 60 minutes and at 70°C for 1 0 minutes. 

[0088] 150 ng of the linear plasmid pSM843 were then iigated with 50 ng of the plasmid of E»coli in 10 |xf of buffer 
(Boehringer), in the presence of 1 U of T4 DNA ligase, incubating for 1 6 hours at 1 6°C. 1 uJ of the ligase mixture was 
used to transform eiectrocompetent cells of Rhodococcus DS-2. The cells were plated on minimum medium in the 
5 presence of DBT and cultivated at 37°C for 4-5 days. 

[0089] The plasmid DNA was extracted from one of the colonies thus obtained, according Lo the method described 
in Serbolisca et aL (1999, Appl. Env. Microbiol. 52:122-126) and subjected to restriction analysis with enzymes for 
which there are sites in the starting plasmlds. The vector thus obtained, containing the sox operon under the control 
of the constitutive promoter, was called pSMB47 (Figure 5) and the strain containing it SMV114. 

10 

EXAMPLE 8 

[0090] The strain SMV114 was cultivated in parallel with the strain Rhodococcus DS7 in minimum medium plus DBT 
with and without 0.20 g/l of MgS0^7H 2 0. 
is [0091] Electrophoresis and Western blot analysis of the soluble proteins extracted from the bacterial cultures showed 
that: 

1 . when the strains are grown in DBT without sulfate, SMV114 expresses higher quantities of Sox A, B and C 
proteins than DS7; 

20 2. in the presence of inorganic sulfate, the strain Rhodococcus DS7 does not express the Sox enzymes, whereas 

SMV114 expresses the same quantity of enzymes with respect to growth without sulfate. 

[0092] The desulfurizing activity of the strain SMV114 was measured on resting cells using DBT as substrate and 
compared with that of the strain Rhodococcus sp. DS7. From a pre-inocuium in minimum medium, two inocula were 
25 effected for each strain in 100 ml of the same medium, without and with inorganic sulfur (0.20 g/l of MgS0 4 -7H 2 0, 80 
mM). The growths were effected for 20 hours at30°C in flasks with breakwater protection. The starting optical density, 
measured at 660 nm, was OD 0.25 for the cultures in MgS0 4 + DBT and 0.4 for the cultures with DBT alone. The dry 
weight was triply determined, from 5 ml of centrifuged culture at 5000-6000 mm for 10 minutes, frozen at -80°C and 
lyophilized. 

30 [0093] For the measurement of the desulfurizing activity, 1 0 ml of culture were removed and centrifuged at 5000-6000 
rpm for 1 0 minutes, at room temperature. The cells were resuspended in 9.75 ml of Tris-HCI 20 mM pH 7 t introduced 
into 50 ml flasks and incubated in a stirred bath at 30°C for 5 minutes. After adding 250 uJ of DBT 40 mM to the 
suspension, a first sampling (1 ml) was effected immediately, which was extracted with 2 ml of ethyl acetate, to deter- 
mine the product background at time zero. Subsequent samples were taken after 1 and 2 hours. 

35 [0094] 20 u.l of the clarified organic phase were analyzed by H PLC on a chromatographic column in C1 8 Vydac type 
TP218-5418 inverse phase, with a program comprising a flow of 1 mi/min for 15 minutes in 80% of acetonitrile. The 
specific activity, expressed as mg of 2HBP produced per hour per gram of dry weight was calculated from the area 
corresponding to the product 2-hydroxybiphenyl (2 HBP) and taking in account the dry weight. 

[0095] The activity values obtained without in organic sulfur corresponded to 5 +/- 0.6 mg/h.g both for the native strain 
40 and for SMV114. When cultivated in the presence of 0.2 g/1 of MgS0 4 , the Rhodococcus DS7 strain did not express 
any desulfurizing activity, whereas the SMV114 strain maintained its activity. 

EXAMPLE 9 

45 [0096] The possibility of using the plasmid pSM847 in other species of Rhodococcus was aJso examined. 1 00 ng of 
the plasmid were used to transform eiectrocompetent cells of strains of Rhodococcus erythropolis , Rhodococcus rho- 
dochrous and Rhodococcus opacus . In all cases colonies capable of desulfurizing DBT were obtained, which contained 
the plasmid having the estimated dimensions. Also in this case, at least 90% of the clones maintained the plasmid for 
about 40 generations in the absence of selective pressure. 

so 
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SEQUENCE LISTING 
SEQ ID NO: 1 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 2290 base pairs 
STRANDEDNESS : Single 



TOPOLOGY : 


Linear 










ATCGATTTTT 


CCGCCGGCGG 


CGCCGCCGCA 


TCCCACGCTG 


CCGTGAGTCC 


50 


GTCGAAAGCT 


CGCAGCATCG ACGGGTGGTA 


CTTACGCAAC 


GCTGCCTGGC 


100 


TGAACCCGGC 


GTTGAGGATG ATGTGCGCCT 


TCTGCCAGTT 


GGGTTCGGAG 


150 


TTGAAAGCGG 


TGAAAAGGCC 


GTCCTCCGCG 


ACTGCCCGCA 


GCTTTCGGAT 


200 


CGGCAGACCC 


ACCGATTTCG 


CCCACAATGC 


CTCGTTGTTG 


ACCTCGGCGA 


250 


CCAGCTCCAC 


CCCGGAGACC 


ACGGTCAGTC 


GGTGATTGAT 


GATCTTGCGC 


300 


TCGAAGATCG 


GCCCCAGCCT 


CGAAGCCATG 


ACCATTTCCT 


TCTGGACAGG 


350 


CTTCACCGAG 


TCGACGCTGA 


GCAAGTCCCC 


CAACACAGGC 


AGCCGCCAGG 


400 


ACGGCGACGG 


AATCGACTCG 


AACTCAAGGA 


CTGGTTTGAC 


CGCGACCTCG 


450 


CTGGTGGGTG 


TCTCACTGCT 


CGGATACGGG 


CACTTGTCGG 


AAGACATGGT 


500 


GGACAGTCTT 


GCATCAGTTC 


CGGACTAGAT 


GCCGACTCGA 


CGCAAAATGG 


550 
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15 



20 



GCGATGGAAC GCAAAGCCGC CATCCAGGAT GGAGCAGCGC GCCAAGAACC 600 

ATGCAAACCA GCGCTACTTG CGCAGACAAG CGGAAGTCGA ACGTGTCGTT 650 

GTTCCCCCTC ATCTTCACTT CCCCACTGCT CTACGGAACC TTCTGAGCTG 700 

10 GCATTTTGCA TGATTTCCGA GCCCCGGAAC CAGCTATACA GTTTTACGGC 750 

AAACCGGGAA ACTGGAAAAC CGTATAGACG GCTACGCGGC AAAACGATAT 8 00 

GCTGTCTATA CGGATAGACG GGAAACCGGA TGCCTGGGAA ACTGGCTAAG 8 50 

TGTGTATACG GCTAACTGGC AAGCCGGGAA GCTGTTTTAC AGGTAACTGT 900 

TTTTACAGCT TTCCGGCAAA CCGGGAAACT GTAAAACCGG CAAACTGGGA 950 

ACTGTAAAAC CGGCAAACTG TAAACAAGCG AAAGGGCCCT TCTCGATGAT 1000 

CATCAGTCTC GTCAACACCA AAGGCGGAAC GGCCAAGACG ACGAGCGCGA 1050 

& TCTACCTTGC ACTCGCGTTT CATAATCGGG GGAGGAAGGT TGTCGTCCTC 1100 

GACTTGGATA AACAGGGTTC AGCAACTGAC TGGGCTGACC GCGCCACAGA 1150 

GGCCGGAGAT CCACTCCCGT TCCCAGTGCA TGTCGTGAAC ATGAAACGGC 1200 

TGGTGAAGTA CGCCACCGAT GGCGATGACC AGGTAGTAAT CATCGACACC 1250 

CCGCCCGGTG ACGGGCAAGT TAT CGACGCT GCAATCGGGG CGTCCAACTT 1300 

CGTCATAATG CCCACGGCGG CTACAGGACT CGACACCGCC AGGGTCTGGG 13 50 

AGACGCTGCC CTCGGTGCAG GGCCGCCTTC CCTACGGAAT CCTCATTACC 1400 

TCCGCACGTC TCGGAACGAA CCTGCTTGAA GATGCCAAGG CAGCGTTCGA 1450 

CAGCAACGAC GCAGCCCGAT TCGACACCGT CATCCCCATG CGCGAGCGCA 1500 

TCCGGTCAAC ATTCGGTACT ACCCCCAAGC ACGACGAAGG GTATTCCGAC 1550 

GTCGTCGACG AGATCACAGA GGCGCTGACA GCATGAGCAT CCCCAAGGCA 1600 

AAGACGACAC CGACGCTCGG GCCCCGCAAG TCCGCGCCAC CGATCGCGAC 1650 

GCCCTCAGTG ACGTCCAGCT TCGTCGAACC GGACGCAGAC CGCACGAAAC 17 00 

TCACCGTCCA GATCGACGCT GAACTGCACC GACGCTTCAA AGCTGCTGTT 1750 

GCAGGAAGCG GCAAGAAGAT GCGCGACGTG GTCGAGGAAA TGATCGAGCA 1800 
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GTGGACCGAC GCGAACAGTG GTCGCTCATA ACTCGGTGAC GCTGTTACGC 1850 

AAAACCAAAG GGAGCCTCGC CGGTTGCACC CGACGAGGCT CCAGACCGGG 1900 

AAACCCCTGG AAGGAAGAAA CCCAGTCATG ACCGACTTTA GCCTTCGGGC 1950 

AGCTCGTATT CAGCTAGGCG CATTGCTCGC CGCACTGGCA CTCGCGATCG 2000 

TCATTGCGCT CGGTATCACC ACCGGCGCAT TCGTACTTTC CTTCGCCGTG 2050 

CAACGCGACC TTGCACGGCA AGCACTGATC CCCGAACACC TGACCTGGAT 2100 

CTTCCCGGCG ATTGTCGACA GCGCTATCCT CGGCGCCACG ATCGCCATCG 2150 

TCATCATCAG CAAGCTCAAC ATGAACAAGC GCGACAGAGG CTTCTACATC 2200 

GCACTCGCCG TCAGCGTTGT CGTGATCAGC ATCCTCGGAA ACGCGTACCA 2250 
CGCCTATCAC GCAGCAATCG CCGCGCAGGA GTCGATCGAT 
SEQ ID NO: 2 

SEQUENCE TYPE: Nucleotide 
30 SEQUENCE LENGTH: 1355 base pairs 

STRANDEDNESS : Single 
TOPOLOGY: Linear 
PROPERTIES : Promoter 

AT CGATCGGC CGGGATCGAC GAGTGTTGCC ATTTCACCGA GCACTTCGCC 

ACCGCGGATC TGACGGCGCC GATCTCGGTG ATGGAGTCCT GGTCGGCGCT 100 

TCCACCTGTG GTCTCGAGGT CGACGACGAC AAACGTCGTT TCGCTCAGCG 150 

GTGTGTCCAG CTCGTCGAAG CTCAGTTGCC GCCCGACGTC GAACTTCTCC 200 

CCGTCGGAGA ACGGGGACGG AGACGGAAAT TCTCGGGCCG AGGCGGGACT 250 

CACACCGCAG ACAGTAGAAG GACCCCCCGA CACGAATTCG AGAGTGCATA 300 

CGCGGAACCC GGCTCACTCC GACTTCCATT GGGAAAGCAA CGGATTGACC 350 

CCCGACAAAA GTACGAGAGA CGAGGATCAC ACTTGCGGTG GGAACTCGCG 4 00 

AATGCGGACT CCTACCTACT TACCGGTAGG GCGATTCGTG AGGGTGTTGC 450 
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TACGGCGATC AGTCGCCGAA ACCGCCTGAT TGATCGAAAC GGGTTCTAGT 5 00 

5 

TTCAAGTAAT CGGCGTTGAA CTGCATGGAA GATCGGATTG TCGGGTTTGG 550 
TGACCCTTGC CAGTGGGTAG CAAGCGCAGG TTTGCGGGCC AGGTGAGCCT 500 
to CGCCGAACAG ACCTGTTGTT ATCTCTTCGT GACCCGTCCG GCGTATCGCG 650 

GGCGCGGCAG CATGATTTCC GGCGAATCGG GCGCGATTGT CACGAAGCGG 700 
TCGACATCGT CGCGATCATT TCGTAACCTT CCTGAGACCT AACGAAGTCT 750 
TGCCGGACCA AACAGGACCG GCATCGTCAG GAACCGCCTA ATCGGGATCT 800 
CCGCGAGGAA CCCGAACCGG GAACCCAAGT TTCCACTGGG GTGAATCCCG 850 
CCTGGGGTGC AGCACGCATG ATCGCTGCGG ACCACGCGGG TAGGGCTGAT 900 
CTTCCCAGCC CGAACCCGTC AGCTAACTCG GTCGGCGGAT GAATGGAAGA 950 

AATGGAGCAC CCCTTAAGTG GCGTCACAGA GTTTCAAGCG TTCAGCGCAG 10 00 

TTGGCAGTAG CCGGCGCGCT CGCAGTCGGA GCATTTGCTG CAACCGCTGC 1050 

30 ACCTGGCTCG GCAGACCCCA TCACGATCCC CGGCGTCGGC ACCTTCGAGG 1100 

TTCCGGGCGC TTCTATTCCT CAGTTGCCGG TCATCCCGGG CATCACCGAC 1150 

ATCGCACCGG CCGCTCCGGC CGCTCCCATC TCCAGTGTTG GTGAGCAGGC 12 00 

AGTTCGCGCC GCAGAGAGCA AGCTGGGCTC CCCGTACGTG TACGGCGCAT 1250 

CGGGCCCGGA CGCATTCGAC TGCTCCGGTC TGGTCCAGTG GGCATACAAG 13 00 

CAGGCTGGTC TGAACCTGCC TCGCACGAGC TACGACCAGG CCGCAGCCTA 13 50 
AATTT 
45 1355 

SEQ ID NO: 3 
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SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 2806 base pairs 
STRANDEDNESS : Single 
TOPOLOGY: Linear 
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PROPERTIES : operon 



AGATGCCACT 


GCGCCATTCC 


TGGTGAGGTA CGTCGGCCGA CCACCGTTCG 


50 


GCTGGCGTAC 


CACATCCACT 


TTCAAGTCCT GCACATCCGC ACCTTTTTTA 


100 


AATCGCCAAC 


TATCGCTATC 


ATCGTTATAT 


GACGATAAAT 


AAAGAGTCGG 


150 


GGTGCCTCGC 


CGCAACCGGC 


AGCGGGTCCA 


GCTTGGATGC 


GGCAGTGGCC 


200 


CTGTTCCACA 


GTCTCTCGGA 


CGGCACCCGA 


TTGTCGATCG 


TGCGCCGTCT 


250 


CGCCGAAGGA 


GAGGCACGGG 


TTGCGGATTT 


GATCGGCGAA 


CTCGGCCTCG 


300 


CCCAGTCGAC 


GGTGTCCGCA 


CATGTCGCAT 


GCCTACGTGA 


TTGCGGTTTG 


350 


GTCGACGGCA 


GACCGGAAGG 


TCGGCAGGTG 


TTCTACTCGC 


TGGCCCGGCC 


400 


GGAATTGATG 


GACCTACTCG 


CCTCGGCGGA 


GACGCTCCTT 


GCCGCGACAG 


450 


GGAACGCGGT 


TGC CCTGTGC 


CCGAATTACG 


GCACCGACAT 


CGGAGATAGC 


500 


CGTGAGTGAC 


GCGTGCGGCT 


GCGGCCACGA 


CGAACCCCGT 


GCCGAGGGCG 


550 


AGGAAGAACA 


CGGGCCCGAA AAGTGGTGGC AGGTTACCGA GATCCGGGCG 


600 


GCTGCAGCTG 


CGGGCGTGCT 


GCTGAT CGCG 


GCCCTGACAG 


TCGGGCTGGC 


650 


CGGCGGACCT 


GATGCGCTCG 


GGATCGGC CT 


CGAAGCGGGC 


GCGCTGATCA 


700 


TTGCCGGCTA 


CACCTTCGTA 


CCGTCCACCC 


TCACACGCCT 


GGCCAAGGGC 


750 


AAAATCGGGG 


TCGGCACCCT 


GATGACGATC 


GCGGCCGTCG 


GCGCGGTACT 


800 


GCTCGGCGAG 


GTCGGCGAAG 


CAGCCATGCT 


CGCATTCTTG 


TTTGCGATCA 


850 


GCGAGGGACT 


CGAGGAGTAC 


GCGGTCACCC 


GCACCCGCCG 


TGGCCTGCGC 


900 


GCGTTACTGT 


CCCTGGTCCC 


GGACACCGCG 


ACAGTGCTCC 


GCGACGGCCG 


950 


GGAGGAAACC 


GTCCCACCCT 


CGGACCTCGA 


GCTCGGTGAG 


GCCATGATCG 


1000 


TCAAGCCCGG 


GGAGCGGATC 


GCCACCGACG 


GTGTCATTCG 


CGCCGGCCGC 


1050 


ACCGCCCTGG 


ACACCTCCGC 


GATCACCGGA 


GAATCGGTGC 


CCGTCGAAGC 


1100 


CGGTCCCGGC 


GACGAGGTAT 


TCGCCGGGTC 


GATCAACGGC 


ACCGGCGTCC 


1150 


TCGAAGTCGA GGTCAGCGCA GAGGCGCAGG ACAATTCGCT GGCGAAGATC 


1200 
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GTGCGGATCG TCGAAGCTGA GCAGTCCCGC 
CGCCGACAAG ATCGCCAAAC CGCTGGTTCC 
CGGTTATCGC CGCGGTCGGC AGCGTGCTCG 
GAACGAGCCC TGGTGGTACT GGTCGCCGCG 
CGCGATACCC GTCACGGTCG TCGCCGCAAT 
GTGTCCTGGT CAAGGGCGGC GCAGCCCTCG 
ACCATCGCAT TGGACAAGAC CGGCACCCTC 
GATCGACGTC GCGACCGCGA ACGGCGCCAA 
TGGCCGCAGC GTTGGAGGCA CGCAGCGAGC 
CTCGCCGCCG TCGAGGACTA CACACCCGCT 
CGGTGCAGGC CTGACCGGCT ACATCGACGG 
GACCAGGCTG GATCGACCCG GGGCCACTGA 
CAGCACGCCG GCGCCACTGC GGTC CTGATC 
CGGGGCGGTC GCCGTCCGCG ACGAACTCCG 
TCACCGAACT ACGCCGCGGC GGCTACACCG 
AACGAGCGCA CTGCCCGCGC TCTGGCCGCC 
GCACGCCGAT CTGCGCCCGG AGGACAAGGC 
GTGCGAGCCG ACCGACGGCG ATGGTCGGGG 
GCCCTGGCCA CCGCCGATCT GGGTATCGCG 
CGTCGCCATC GAGACCGCCG ATGTCGCTTT 
ATCTTCCTCA GGCTCTGCAG CACGCGCGAC 
CAGAACGTCG GCCTGTCTTT GGCGATCATC 
TCTGTTCGGT GTACTCGGAT TGGCTGCGGT 
CCGAGATCGT CGTCATCGCC AACGGGGTGC 
CTGGCTGCTG TAC CGCAGTC GACCAGGGCG 



AAGGGCGAGG 
GGGCGTAATG 
GTGATCCGCT 
TCCCCGTGCG 
CGGCGCAGCA 
AAGCGCTCGG 
ACCCGCAACC 
CCGCGGTGAC 
ACCCGCTCGC 
GATGACGCGG 
CATCCCCGTC 
CCGGGGATAT 
GAACGCGCCG 
CCCGGAAGCA 
TCGCGATGCT 
GATGTCGGCA 
GCGCATTGTG 
ACGGTGTCAA 
ATGGGTGCGA 
GATGGGCGAG 
GATCGCGGTC 
ACCGTTCTGA 
CGTGCTCGTC 
GCGCCGGGCG 
TCCGAGCCGG 



CGCAACGCCT 1250 

ATCGTCGCCG 1300 

GGTGTGGATC 1350 

CGCTGGCCAT 1400 

TCCAAGCTCG 14 50 

CAGGATCCGC 1500 

AACCCGCCGT 1550 

GTTCTCGCGG 1600 

TCGCGCGATT 1650 

ACGCAGTCAT 17 00 

CGG CTCGGTC 1750 

CGAACGAATG 18 00 

GAACCGTGAT 1850 

CGCGAAGTGG 1900 

CACCGGCGAC 1950 

TCGACGACGT 2 000 

GAGACCTTGC 2 050 

CGATGCCCCC 2100 

TGGGAACCGA 2150 

GACCTGCGGC 2200 

GATCATGTTG 2250 

TGCCTCTGGC 23 00 

CACGAGGTCG 23 5 0 

TACCCGAAGT 2400 

CCACGGTAGT 2450 



15 



EP 1 127 943 A2 



10 



15 



20 



25 



30 



35 



40 



50 



ACGGCGATGA CATCTGCCGC CAGCTCCGGG TCATCATCGG TGTCCGTGGT 2500 

CGGAATGCGT CGCGGCCGGG GTGTTCTCGT CATACTCGCA GGTGCCTTGG 2550 

CTGCGGTCGC GCTGCTCCTC GACCCGGTGG CACGTGGCGC GCTGTCCGGT 2600 

GGGCAGGTAG TCGATTTCGG GGTCTTGCAA CTGCGATTGG CCTACAACAG 2 650 

CGGTGTCGCG TTCAGTCTCG GTGACCAGCT CCCCACAGTC GTCGTTCTCG 2700 

CCGGCACCGC TGCCCTCACC GCCGCGATCG GTGTGTTCGC CTGGCGCACA 2750 

GCATCTGAGC GTCCCGTACT CCAGACCATC GGGCTGGCCG CGATCACGGC 2800 

CGGTACC 2806 
SEQ ID NO: 4 



SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 22 base pairs 
STRANDEDNESS : Single 
TOPOLOGY: Linear 
PROPERTIES : Primer 
CAGTCACGAC GTTGTAAAAC GA 
SEQ ID NO: 5 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH; 20 base pairs 
STRANDEDNESS: Single 
45 TOPOLOGY: Linear 

PROPERTIES : Primer 
TGCATTTGTC GTTGTTGAGT 
SEQ ID NO: 6 
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SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 23 base pairs 
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STRAND EDNESS : Single 
TOPOLOGY: Linear 
PROPERTIES : Primer 
GATCGGCCGGG ATCCACGAGT GTT 
SEQ ID NO: 7 

SEQUENCE , TYPE : Nucleot ide 
SEQUENCE LENGTH: 58 base pairs 
S TRANDEDNE S S : Single 
TOPOLOGY; Linear 
PROPERTIES ; Primer 

ACCCAACACT GAGCTGTTAA CGGCCGGAGC GGCCGATGCA 
TTTTAGGTGA TGCCCGGG 
SEQ ID NO: 8 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 23 base pairs 
STRANDEDNES S : Single 
TOPOLOGY : Linear 
PROPERTIES : Primer 
TGGCCCGGCC GGAATTGATG GAC 
SEQ ID NO: 9 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 24 base pairs 
STRANDEDNES S : S ingle 
TOPOLOGY : Linear 
PROPERTIES : Primer 
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GCCGACGGCC GCGATCGTGA TCAG 

SEQ ID NO: 10 

SEQUENCE TYPE: Nucleotide 

SEQUENCE LENGTH: 23 base pairs 

STRANDEDNESS : Single 

TOPOLOGY: Linear 

PROPERTIES: Primer 
ATTCGAGAGT GCATATGCGG AAC 
SEQ ID NO: 11 
SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 23 base pairs 
STRANDEDNESS: Single 
TOPOLOGY: Linear 
PROPERTIES: Primer 
CCATTTCTTC CAAGCTTCCG CCG 



Clafms 

1. A cloning vector pSM843 which comprises: 

(a) the rep genes, ORF81 and trbA which encode proteins involved in replication in Rhodococcus' 

(b) the gene parA having the sequence SEQ. fD. Nr. 1 ; and 1 

(c) at least one gene which encodes a genetic marker selected from the genes of the cad operon, which confer 
resjstance to cadmium, or the genes which encode resistance to an antibiotic. 

2. The vector according to cfaim 1 , deposited with the number CBS 1 02446. 

3. An expression vector which comprises: 



(a) the rep genes, ORF81 and trbA which encode proteins involved in replication in Rhodococcus- 

(b) the gene parA having the sequence SEQ. ID. Nr. 1 ; 1 

(c) a constitutive promoter of Rhodococcus having thesequence SEQ. JD. Nr. 2; 

(d) a multiple cloning site downstreamr the promoter; 

(e) at least one gene which encodes a genetic marker selected from the genes of the cad operon, which confer 
resistance to cadmium, or the genes which encode resistance to an antibiotic; and 

(f) contains the replication origin in E.coli , deposited with the number CBS 1 02445. 
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4. The vector according to claim 3, which comprises downstream the constitutive promoter one or more genes which 
encode a protein of interest. 

5. The expression vector according to claim 4, wherein the proteins are selected from enzymes involved in the se- 
lective removal of organic sulfur from fossil fuels or enzymes involved In the production of L-aminoacids, enanti- 
omorphs of chiral compounds and carbDxylic acids. 

6. The expression vector pSM847 according to claim 4, which comprises downstream the constitutive promoter the 
sox operon which encodes SoxA : SoxB and SoxC enzymes deposited with the number CBS 102447. 

7. The expression vector according to claim 3, obtained by: 



(1 ) construction of the cloning plasmid vector pSM843; 

(2) isolation of a constitutive promoter of Rhodococcus ; and 

15 (3) insertion of said constitutive promoter in the vector pSM843, 

8. A microorganism transformed with the expression vector according to claims 3 to 6, wherein said microorganism 
is selected from Rhodococcus , Gordona and Nocardia . 

20 9. A strain of Rhodococcus transformed with the expression vector pSM847 deposited with the number CBS 1 02447. 

10. A process for the production of homologous or heterologous proteins of interest which comprises cultivating, under 
suitable conditions, a microorganism transformed with the expression vector according to claims 3 to 6. 

25 11 . The process according to ciaim 1 0, wherein the protein is selected from enzymes involved in the selective removal 
of organic sulfur from fossil fuels and in the production of L-aminoacids, enantiomorphs of chira* compounds and 
carboxylic acids. 

12, The process according to claim 1 1 , wherein the proteins are Sox enzymes and the strain is Rhodococcus SMV114 
so CBS 1 02447. 

13. A process for the removal of organic suifur from fossil fuels characterized in that it uses a microorganism selected 
from Rhodococcus , Gordona and Nocardia transformed with the expression vector CBS 102445 containing the 
sox operon downstream the constitutive promoter. 



14. The process according to claim 13, wherein the microorganism is Rhodococcus SMV114 CBS 102447. 



15. A research method for promoters in microorganisms capable of integrating at random fragments of foreign DNA 
in its chromosome without requiring a sequence homology higher than 3 bp between the donor DNA and that of 
40 the host, which consists in: 

(i) transforming said microorganism directly with a gene reporter without its promoter or with a multicopy plas- 
mid of E-cofi containing said gene and linearized upstream the gene reporter; 

(ii) selecting the clones which express said gene, i.e. the clones which have integrated the gene reporter in 
43 their chromosome, downstream a promoter sequence; 

(iii) digesting the chromosomal DNA of the clones selected with restriction enzymes which cut upstream and 
downstream the gene; 

(iv) amplifying the DNA obtained in step (iii); and 

(v) sequencing the promoter upstream the gene reporter. 



16. The method according to claim 15, wherein the microorganism is a strain of Rhodococcus . 

17. The method according to claim 15, wherein the gene reporter is selected from those which encode resistance to 
antibiotics, heavy metals or enzymes such as XylE or Sox proteins, 

18. A constitutive promoter of Rhodococcus characterized by the sequence SEQ. ID. Nr. 2. 
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